A note on the validity of cross-validation for evaluating autoregressive time series prediction

نویسندگان

  • Christoph Bergmeir
  • Rob J. Hyndman
  • Bonsoo Koo
چکیده

One of the most widely used standard procedures for model evaluation in classification and regression is K-fold cross-validation (CV). However, when it comes to time series forecasting, because of the inherent serial correlation and potential non-stationarity of the data, its application is not straightforward and often omitted by practitioners in favour of an out-of-sample (OOS) evaluation. In this paper, we show that in the case of a purely autoregressive model, the use of standard K-fold CV is possible as long as the models considered have uncorrelated errors. Such a setup occurs, for example, when the models nest a more appropriate model. This is very common when Machine Learning methods are used for prediction, where CV in particular is suitable to control for overfitting the data. We present theoretical insights supporting our arguments. Furthermore, we present a simulation study and a real-world example where we show empirically that K-fold CV performs favourably compared to both OOS evaluation and other time-series-specific techniques such as non-dependent cross-validation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Functional-Coefficient Autoregressive Model and its Application for Prediction of the Iranian Heavy Crude Oil Price

Time series and their methods of analysis are important subjects in statistics. Most of time series have a linear behavior and can be modelled by linear ARIMA models. However, some of realized time series have a nonlinear behavior and for modelling them one needs nonlinear models. For this, many good parametric nonlinear models such as bilinear model, exponential autoregressive model, threshold...

متن کامل

Availability Prediction of the Repairable Equipment using Artificial Neural Network and Time Series Models

In this paper, one of the most important criterion in public services quality named availability is evaluated by using artificial neural network (ANN). In addition, the availability values are predicted for future periods by using exponential weighted moving average (EWMA) scheme and some time series models (TSM) including autoregressive (AR), moving average (MA) and autoregressive moving avera...

متن کامل

SVM-Based Time Series Prediction with Nonlinear Dynamics Methods

A key problem in time series prediction using autoregressive models is to fix the model order, namely the number of past samples required to model the time series adequately. The estimation of the model order using cross-validation is a long process. In this paper we explore faster alternative to cross-validation, based on nonlinear dynamics methods, namely Grassberger-Procaccia, Kégl and False...

متن کامل

A Note on the Validity of Cross-Validation for Evaluating Time Series Prediction

One of the most widely used standard procedures for model evaluation in classification and regression is K-fold cross-validation (CV). However, when it comes to time series forecasting, because of the inherent serial correlation and potential non-stationarity of the data, its application is not straightforward and often omitted by practitioners in favor of an out-of-sample (OOS) evaluation. In ...

متن کامل

Prediction of Above-elbow Motions in Amputees, based on Electromyographic(EMG) Signals, Using Nonlinear Autoregressive Exogenous (NARX) Model

Introduction In order to improve the quality of life of amputees, biomechatronic researchers and biomedical engineers have been trying to use a combination of various techniques to provide suitable rehabilitation systems. Diverse biomedical signals, acquired from a specialized organ or cell system, e.g., the nervous system, are the driving force for the whole system. Electromyography(EMG), as a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 120  شماره 

صفحات  -

تاریخ انتشار 2018